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' Consider an i.i.d. sample X*, X 2 , . . . , X* from a location-scale family, and as- 

C/3 ■ sume that the only available observations consist of the partial maxima (or 

minima) sequence, X* :1 , X 2:2 , . . . , X*. n , where X*. - = max{A^, . . . , X*}. This 
kind of truncation appears in several circumstances, including best perfor- 
mances in athletics events. In the case of partial maxima, the form of the 
BLUEs (best linear unbiased estimators) is quite similar to the form of the 
well-known Lloyd's (1952, Least-squares estimation of location and scale pa- 
■ rameters using order statistics, Biometrika, vol. 39, pp. 88-95) BLUEs, based 

on (the sufficient sample of) order statistics, but, in contrast to the classi- 
cal case, their consistency is no longer obvious. The present paper is mainly 
concerned with the scale parameter, showing that the variance of the partial 
. maxima BLUE is at most of order 0(1/ log n) , for a wide class of distributions. 

O . 
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x. 

1 Introduction 

There are several situations where the ordered random sample, 

K.n < K.n < • • ■ < K.n, (LI) 

corresponding to the i.i.d. random sample, X*, X 2 , ■ ■ ■ , X*, is not fully reported, 
because the values of interest are the higher (or lower), up-to-the-present, record 
values based on the initial sample, i.e., the partial maxima (or minima) sequence 

K.1 < K.2 < ' ' • < K:n, (1-2) 

where X*.- = max{Xj", . . . , X*}. A situation of this kind commonly appears in 
athletics, when only the best performances are recorded. 

Through this article we assume that the i.i.d. data arise from a location-scale 
family, 

{F((--e 1 )/e 2 y,9 1 eR,e 2 >o}, 



where the d.f. F(-) is free of parameters and has finite, non-zero variance (so that F 
is non-degenerate), and we consider the partial maxima BLUE (best linear unbiased 
estimator) for both parameters 9\ and 82- This consideration is along the lines of the 
classical Lloyd's (1952) BLUEs, the only difference being that the linear estimators 
are now based on the "insufficient sample" (1.2), rather than (1.1), and this fact 
implies a substantial reduction on the available information. Tryfos and Blackmore 
(1985) used this kind of data to predict future records in athletic events, Samaniego 
and Whitaker (1986, 1988) estimated the population characteristics, while Hofmann 
and Nagaraja (2003) investigated the amount of Fisher Information contained in such 
data; see also Arnold, Balakrisnan & Nagaraja (1998, Section 5.9). 

A natural question concerns the consistency of the resulting BLUEs, since too 
much lack of information presumably would result to inconsistency (see at the end of 
Section 6). Thus, our main focus is on conditions guaranteeing consistency, and the 
main result shows that this is indeed the case for the scale parameter BLUE from a 
wide class of distributions. Specifically, it is shown that the variance of the BLUE is 
at most of order 0(l/logn), when F(x) has a log-concave density f(x) and satisfies 
the Von Mises-type condition (5.11) or (6.1) (cf. Galambos (1978)) on the right end- 
point of its support (Theorem 5.2, Corollary 6.1). The result is applicable to several 
commonly used distributions, like the Power distribution (Uniform), the Weibull 
(Exponential), the Pareto, the Negative Exponential, the Logistic, the Extreme Value 
(Gumbel) and the Normal (see section 6). A consistency result for the partial maxima 
BLUE of the location parameter would be desirable to be included here, but it seems 
that the proposed technique (based on partial maxima spacings, section 4) does not 
suffice for deriving it. Therefore, the consistency for the location parameter remains 
an open problem in general, and it is just highlighted by a particular application to 
the Uniform location-scale family (section 3). 

The proof of the main result depends on the fact that, under mild conditions, 
the partial maxima spacings have non-positive correlation. The class of distributions 
having this property is called NCP (negative correlation for partial maxima spacings). 
It is shown here that any log-concave distribution with finite variance belongs to 
NCP (Theorem 4.2). In particular, if a distribution function has a density which 
is either log-concave or non-increasing then it is a member of NCP. For ordinary 
spacings, similar sufficient conditions were shown by Sarkadi (1985) and Bai, Sarkar 
& Wang (1997) - see also David and Nagaraja (2003, pp. 187-188), Burkschat (2009), 
Theorem 3.5 - and will be referred as "S/BSW-type conditions". 

In every experiment where the i.i.d. observations arise in a sequential manner, 
the partial maxima data describe the best performances in a natural way, as the 
experiment goes on, in contrast to the first n record values, R±,R2, ■ ■ ■ ,R n , which 
are obtained from an inverse sampling scheme - see, e.g., Berger and Gulati (2001). 
Due to the very rare appearance of records, in the latter case it is implicitly assumed 
that the sample size is, roughly, e n . This has a similar effect in the partial maxima 
setup, since the number of different values are about logn, for large sample size n. 
Clearly, the total amount of information in the partial maxima sample is the same 
as that given by the (few) record values augmented by record times. The essential 
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difference of these models (records / partial maxima) in statistical applications is 
highlighted, e.g., in Tryfos and Blackmore (1985), Samaniego and Whitaker (1986, 
1988), Smith (1988), Berger and Gulati (2001) and Hofmann and Nagaraja (2003) - 
see also Arnold, Balakrishnan & Nagaraja (1998, Chapter 5). 

2 Linear estimators based on partial maxima 

Consider the random sample X*,X%, . . . ,X* from F((x — 9i)/9 2 ) and the corre- 
sponding partial maxima sample X* :1 ^ X 2 . 2 ^ • • • ^ X*. n (9 1 e K is the location 
parameter and 9 2 > is the scale parameter; both parameters are unknown). Let 
also Xi, X 2 , . . . , X n and X 1:1 ^ X 2: 2 ^ • • • ^ X n:n be the corresponding samples from 
the completely specified d.f. F(x), that generates the location-scale family. Since 

(Xl :1 ,X 2 . 2 , . . . , X*. n )' = (9 1 + 9 2 Xi : i, 9\ + 9 2 X 2 -, 2 , . . . , 9\ + 9 2 X n:n )' , 

a linear estimator based on partial maxima has the form 

n n n 

i=i i=i i=i 

for some constants q, % — 1, 2, . . . , n. 

Let X = (Xi-i, X 2:2 , . . . , X n:n )' be the random vector of partial maxima from the 
known d.f. F(x), and use the notation 

fjt=E[X], S=D[X] and£=E[XX'], (2.1) 

where D [£] denotes the dispersion matrix of any random vector £. Clearly, 

S = E - nv', £ > 0, E > 0. 

The linear estimator L is called BLUE for # fc (k = 1,2) if it is unbiased for 9 k and 
its variance is minimal, while it is called BLIE (best linear invariant estimator) for 
9k if it is invariant for 9^ and its mean squared error, MSE[L] = E[L — 9k} 2 , is 
minimal. Here "invariance" is understood in the sense of location-scale invariance as 
it is defined, e.g., in Shao (2005, p. xix). 

Using the above notation it is easy to verify the following formulae for the BLUEs 
and their variances. They are the partial maxima analogues of Lloyd's (1952) estima- 
tors and, in the case of partial minima, have been obtained by Tryfos and Blackmore 
(1985), using least squares. A proof is attached here for easy reference. 

Proposition 2.1 The partial maxima BLUEs for 9\ and for 9 2 are, respectively, 

Li = -^/xTX* and L 2 = ^l'VX*, (2.2) 

where X* = (X* :1 , X* :2 , . . . , X* n .J, A = (l'S" 1 !)^" 1 /*) - (l'ST 1 /*) 2 > 0, 1 = 
(1, 1, . . . , 1)' G R n and T = S -1 (l/i' - fil')^- 1 . The corresponding variances are 

Vax[Li] = ^(/i'S-V)^ ^ Var[L 2 ] = ^-(l'S- 1 !)^. (2.3) 
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Proof: Let c = (ci,c 2 , . . . , c n )' e K n and L = c'X*. Since E[L] = (c'l)0i + (c»0 2 , 
L is unbiased for #1 iff c'l = 1 and c'/x = 0, while it is unbiased for 9 2 iff c'l = and 
c'/it = f. Since Var[L] = (c'£c)#|, a simple minimization argument for c'Sc with 
respect to c, using Lagrange multipliers, yields the expressions (2.2) and (2.3). □ 

Similarly, one can derive the partial maxima version of Mann's (f 969) best linear 
invariant estimators (BLIEs), as follows. 

Proposition 2.2 The partial maxima BLIEs for 9\ and for 9 2 are, respectively, 

_ l'E^X* , m l'GX* . . 

Ti = = — and T 2 = =-, 2.4 

where X* and 1 are as in Proposition 2.1 and G = E~ l {lii' - /xl'j-ET 1 . The 

corresponding mean squared errors are 

MSE [Ti] = and M SE[T 2 ] = (l - ^7^4) °l ( 2 - 5 ) 

w/iere D = {VE^l^u'E^n) ~ {VE' 1 ^) 2 > 0. 

Proof: Let L = L{X*) = c'X* be an arbitrary linear statistic. Since L(bX* + al) = 
a(c'l) + bL(X*) for arbitrary a E R and 6 > 0, it follows that L is invariant for ^ x 
iff c'l = I while it is invariant for 9 2 iff c'l = 0. Both (2.4) and (2.5) now follow 
by a simple minimization argument, since in the first case we have to minimize the 
mean squared error E[L — 9\\ 2 = (c'Ec)9\ under c'l = 1, while in the second one, 
we have to minimize the mean squared error E[L — 9 2 ] 2 = (c'Ec — 2/x'c+ 1)#| under 
c'l = 0. □ 

The above formulae (2.2)-(2.5) are well-known for order statistics and records 
- see David (1981, Chapter 6), Arnold, Balakrishnan & Nagaraja (1992, Chapter 
7; 1998, Chapter 5), David and Nagaraja (2003, Chapter 8). In the present setup, 
however, the meaning of X*, X, /x, S and E is completely different. In the case 
of order statistics, for example, the vector /x, which is the mean vector of the order 
statistics X = (X 1:n ,X 2:n , . . . ,X n:n )' from the known distribution F(x), depends on 
the sample size n, in the sense that the components of the vector /x completely change 
with n. In the present case of partial maxima, the first n entries of the vector /j,, which 
is the mean vector of the partial maxima X = (X^i, X 2:2 , . . . , X n:n )' from the known 
distribution F(x), remain constant for all sample sizes n' greater than or equal to n. 
Similar observations apply for the matrices S and E. This fact seems to be quite 
helpful for the construction of tables giving the means, variances and covariances 
of partial maxima for samples up to a size n. It should be noted, however, that 
even when F(x) is absolutely continuous with density f(x) (as is usually the case for 
location-scale families), the joint distribution of (X i:i ,Xj.j) has a singular part, since 
P[X i: j = Xj-j] = i/j > 0, i < j. Nevertheless, there exist simple expectation and 
covariance formulae (Lemma 2.2). 
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As in the order statistics setup, the actual application of formulae (2.2) and (2.4) 
requires closed forms for fi and E, and also to invert the n x n matrix S. This 
can be done only for very particular distributions (see next section, where we apply 
the results to the Uniform distribution). Therefore, numerical methods should be 
applied in general. This, however, has a theoretical cost: It is not a trivial fact to 
verify consistency of the estimators, even in the classical case of order statistics. The 
main purpose of this article is in verifying consistency for the partial maxima BLUEs. 
Surprisingly, it seems that a solution of this problem is not well-known, at least to 
our knowledge, even for the classical BLUEs based on order statistics. However, 
even if the result of the following lemma is known, its proof has an independent 
interest, because it proposes alternative (to BLUEs) n _1 / 2 -consistent unbiased linear 
estimators and provides the intuition for the derivation of the main result of the 
present article. 

Lemma 2.1 The classical BLUEs of 9i and 9 2 , based on order statistics from a 
location-scale family, created by a distribution F(x) with finite non-zero variance, 
are consistent. Moreover, their variance is at most of order 0(1/ n) . 

Proof: Let X* = (X* 1:n , X* :n , . . . , X* : J and X = (X 1:n: X 2:n , . . . , X n:n )> be the 

ordered samples from F((x — 9i)/9 2 ) and F(x), respectively, so that X* = 9±1 + 9 2 X. 
Also write X{, X 2) . . . ,X* and Xi, X 2 , . . . , X n for the corresponding i.i.d. samples. 
We consider the linear estimators 



1 n 

S 1 = X* = -YX* = 9 1 + 9 2 X 

n 



and 



n n q n n 

s? = —^YY\x:-x:\ = ^-^YY\x 3 -x> 



l i 1 1 



nin-V)^^ 3 1 n(n-l) . 

i.e., Si is the sample mean and S 2 is a multiple of Gini's statistic. Observe that 
both Si and S 2 are linear estimators in order statistics. [In particular, S 2 can be 
written as S 2 = 4(n(n - l))" 1 £™ =1 (i - (n + l)/2)X* n \ Clearly, E(^) = 9 ± + 9 2 fi , 
E (S 2 ) = 9 2 r , where /i is the mean, E(Xx), of the distribution F(x) and r is the 
positive finite parameter E|Xx — X 2 \. Since F is known, both /i e K and r > are 
known constants, and we can construct the linear estimators TJ\ — S\ — (^o/ T o)S 2 and 
U 2 = S 2 /t . Obviously, E(f4) = 9^, k = 1,2, and both Ui, U 2 are linear estimators 
of the form T n = (1/n) Yli=i n )X*m with \6(i, n)\ uniformly bounded for all i and 
n. If <7q is the (assumed finite) variance of F(x), it follows that 



-, n n 

Var[T n ] < -^^|^,n)||5(j,n)|Cov(X* n ,X; : J 

71 i=l j=l 

^ ±(m8*\5(i,n)\) Var(X* ri + X* n + --- + X, 



2 

,2_2 



max \5(i,n)\ } 9 2 ai = 0(n ) — > 0, as n — > oo, 
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showing that Var(CTfc) — > 0, and thus Uk is consistent for 6k, k = 1,2. Since Lk has 
minimum variance among all linear unbiased estimators, it follows that Var(Lfe) ^ 
Var(£4) ^ 0{l/n), and the result follows. □ 

The above lemma implies that the mean squared error of the BLIEs, based on 
order statistics, is at most of order 0(l/n), since they have smaller mean squared 
error than the BLUEs, and thus they are also consistent. More important is the fact 
that, with the technique used in Lemma 2.1, one can avoid all computations involving 
means, variances and covariances of order statistics, and it does not need to invert 
any matrix, in order to prove consistency (and in order to obtain C^ttT 1 ) -consistent 
estimators). Arguments of similar kind will be applied in section 5, when the problem 
of consistency for the partial maxima BLUE of 6 2 will be taken under consideration. 

We now turn in the partial maxima case. Since actual application of partial 
maxima BLUEs and BLIEs requires the computation of the first two moments of X = 
(Xi-i, X 2: 2, ■ ■ ■ , X n . n )' in terms of the completely specified d.f. F(x), the following 
formulae are to be mentioned here (cf. Jones and Balakrishnan (2002)). 

Lemma 2.2 Let X^i ^ X 2 - 2 ^ • • • ^ X n:n be the partial maxima sequence based on 
an arbitrary d.f. F(x). 

(i) Fori ^ j, the joint d.f. of (X i:i , Xj : j) is 

F Xi:i , Xj:j (x,y) - ^ F i {x)Fj -i {y) if x<y (2-6) 

(ii) // F has finite first moment, then 

poo rO 

Hi = E[X i:i \ = / (1 -F^x)) dx- F\x)dx (2.7) 

JO J -oo 

is finite for all i. 

(iii) If F has finite second moment, then 



an = Cov[X i:i ,X j:j ] = / / F\x){Fi-\x) + F^(y))(l - F\y)) dy dx 

J J —oo<x<y<oo 

(2.8) 

is finite and non-negative for all i ^ j. In particular, 



a u = of = Var[X J:J ] = 2 F\x){\ - F\y)) dy dx. (2.9) 

J J— oo<x<y<oo 

Proof: (i) is trivial and (ii) is well-known, (iii) follows from Hoeffding's identity 

/oo roo 
/ (F x , Y (x,y)-F x (x)F Y (y)) dy dx 
-oo J —oo 

(see Hoeffding (1940), Lehmann (1966), Jones and Balakrishnan (2002), among oth- 
ers), applied to (X, Y) = (X i:i ,Xj : j) with joint d.f. given by (2.6) and marginals 
F\x) and F^(y). □ 
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Formulae (2.7)-(2.9) enable the computation of means, variances and covariances 
of partial maxima, even in the case where the distribution F does not have a density. 
Tryfos and Blackmore (1985) obtained an expression for the covariance of partial 
minima involving means and covariances of order statistics from lower sample sizes. 



3 A tractable case: the Uniform location-scale fam-l 

iiy 

Let xt,x;,...,x' ~ £/(»,, o, + <y, so that (x' 1:1 , x' 2:2 , . . . , x'.j i ea + e 2 x, 

where X = (Xi : i,X 2: 2, • • • , X n:n )' is the partial maxima sample from the standard 
Uniform distribution. Simple calculations, using (2.7)-(2.9), show that the mean 
vector n = (/j,j) and the dispersion matrix £ = (o^-) of X are given by (see also 
Tryfos and Blackmore (1985), eq. (3.1)) 



i + 1 



and <x 



(i + 1)0' + 1)0' + 2) 



for 1 ^ i ^ j ^ n. 



Therefore, XI is a patterned matrix of the form = a^bj for % $C j, and thus, 
its inverse is tridiagonal; see Graybill (1969, Chapter 8), Arnold, Balakrishnan & 
Nagaraja (1992, Lemma 7.5.1). Specifically, 



/ 7i Si 
-8i 72 
-5 2 















S 2 

73 












7n-l 









\ 



where 



li 



4( i + l)3(, + 2 )2 
(2i + l)(2i + 3) ' 

and 7„ = 



(i + l)(i + 2) 2 (i + 3) 
2^ + 3 
(n + l) 2 (n + 2) 2 
2n + 1 



-8n-l 
In ) 



1,2, 



,n - 1, 



Setting a{n) = l'S _1 l, 6(n) = (1 - - /*) and c(n) = (1 - m)'^" 1 !, we get 



b(n) 
c(n) 



(n + l) 2 (n + 2) 2 
2n + 1 

2 n-1 



n-1 



+ + 2) 2 (32 + l) 2 2 



(2i + l)(2i + 3) 



n + 2 2 ^ 2 -12 + 2 1 „ . 

2> ■+ T7 + = -logn + o logn , 

^ (22 + l)(22 + 3) 2 & v & 



2n+ 1 

(n + l)(n + 2) 2 
2n + 1 



n— 1 

E 

i=i 



(2' + 2)(42' 2 + 72'+ 1) 

(2i + l)(2i + 3) 



= n + o(n). 
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Applying (2.3) we obtain 



w rr i + - 2c(n „ / 2 / 1 , , /r , 

Var Li = / vw \ — TT^ 1 ^ = ; + o : ^ and 

1 J a{n)b(n) - c 2 (n) 1 \logn \logn '• ~ 



a(n) „ / 2 / 1 



Var t L 2] = 7 "V"' 2 , = ( r^- + ( r^— ) °l- 

a(n)b(n) — c 2 (n) \logn \\ogn J J 

The preceding computation shows that, for the Uniform location-scale family, the 
partial maxima BLUEs are consistent for both the location and the scale parameters, 
since their variance goes to zero with the speed of 2/logn. This fact, as expected, 
contradicts the behavior of the ordinary order statistics BLUEs, where the speed of 
convergence is of order n~ 2 for the variance of both Lloyd's estimators. However, the 
comparison is quite unfair here, since Lloyd's estimators are based on the complete 
sufficient statistic {X{. n , X* :n ), and thus the variance of order statistics BLUE is 
minimal among all unbiased estimators. 

On the other hand we should emphasize that, under the same model, the BLUEs 
(and the BLIEs) based solely on the first n upper records are not even consistent. 
In fact, the variance of both BLUEs converges to 0f/3, and the MSE of both BLIEs 
approaches 0f/4, as n — > oo; see Arnold, Balakrishnan & Nagaraja (1998, Examples 
5.3.7 and 5.4.3). 



4 Scale estimation and partial maxima spacings 

In the classical order statistics setup, Balakrishnan and Papadatos (2002) observed 
that the computation of BLUE (and BLIE) of the scale parameter is simplified con- 
siderably if one uses spacings instead of order statistics - cf. Sarkadi (1985). Their 
observation applies here too, and simplifies the form of the partial maxima BLUE 
(and BLIE). 

Specifically, define the partial maxima spacings as Z* = X* +1 . i+1 — X* :i ^ and 
Zi = X i+1:i+1 - X i:i ^ 0, for i = 1, 2, . . . , n - 1, and let Z* = (Z*, Z*, .' . . , Z*_J 

and Z = (Zi, Z 2 , . . . , Z n _i)'. Clearly, Z* = 9 2 Z, and any unbiased (or even in- 
variant) linear estimator of 9 2 based on the partial maxima sample, L = c! X* , 
should necessarily satisfy X^=i Ci = ( see ^ ne P ro °f s °f Propositions 2.1 and 2.2). 
Therefore, L can be expressed as a linear function on Z* J s, L = b'Z*, where now 
b = (&x, b 2 , ■ ■ ■ , frn-i)' ^ IR™" 1 - Consider the mean vector m = E[Z], the dispersion 
matrix S = D[Z], and the second moment matrix D = E[ZZ'] of Z. Clearly, 

5 = D — mm', S > 0, D > 0, and the vector m and the matrices S and D are of 
order n — 1. Using exactly the same arguments as in Balakrishnan and Papadatos 
(2002), it is easy to verify the following. 

Proposition 4.1 The partial maxima BLUE of 9 2 , given in Proposition 2.1, has the 
alternative form 

m' S^ 1 Z* 9 2 
L2= , with Var[L 2 ]= \ , (4.1) 

rn'S m rn'S m 
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while the corresponding BLIE, given in Proposition 2.2, has the alternative form 

T 2 = m'D^Z*, with MSE[T 2 ] = (1 - m'D _1 m)^. (4.2) 

It should be noted that, in general, the non-negativity of the BLUE of 62 does 
not follow automatically, even for order statistics. In the order statistics setup, this 
problem was posed by Arnold, Balakrishnan & Nagaraja (1992), and the best known 
result, till now, is the one given by Bai, Sarkar & Wang (1997) and Sarkadi (1985). 
Even after the slight improvement, given by Balakrishnan and Papadatos (2002) and 
by Burkschat (2009), the general case remains unsolved. The same question (of non- 
negativity of the BLUE) arises in the partial maxima setup, and the following theorem 
provides a partial positive answer. We omit the proof, since it again follows by a 
straightforward application of the arguments given in Balakrishnan and Papadatos 
(2002). 

Theorem 4.1 (i) There exists a constant a = a n (F), < a < 1, depending only on 
the sample size n and the d.f. F(x) (i.e., a is free of the parameters 6\ and 9 2 ), such 
that T 2 = aL 2 . This constant is given by a = m'D l_1 m = m'S l m/ (1 + m'S l m) . 
(ii) // either n = 2 or the (free of parameters) d.f. F{x) is such that 

Cov [Zi, Zj] ^ for all i ^ j , i,j — 1, . . . , n — 1, (4.3) 

then the partial maxima BLUE (and BLIE) of 62 is non-negative. 

Note that, as in order statistics, the non-negativity of L 2 is equivalent to the fact 
that the vector S _1 m (or, equivalently, the vector D~ l m) has non-negative entries; 
see Balakrishnan and Papadatos (2002) and Sarkadi (1985). 

Since it is important to know whether (4.3) holds, in the sequel we shall make 
use of the following definition. 

Definition 4.1 A d.f. F(x) with finite second moment (or the corresponding density 
f(x), if exists) 

(i) belongs to the class NCS (negatively correlated spacings) if its order statistics 
have negatively correlated spacings for all sample sizes n ^ 2. 

(ii) belongs to the class NCP if it has negatively correlated partial maxima spacings, 
i.e., if (4.3) holds for all n ^ 2. 

An important result by Bai, Sarkar & Wang (1997) states that a log-concave den- 
sity f(x) with finite variance belongs to NCS - cf. Sarkadi (1985). We call this suf- 
ficient condition as the S/BSW-condition (for ordinary spacings). Burkschat (2009, 
Theorem 3.5) showed an extended S/BSW-condition, under which the log-concavity 
of both F and 1 — F suffice for the NCS class. Due to the existence of simple formu- 
lae like (4.1) and (4.2), the NCS and NCP classes provide useful tools in verifying 
consistency for the scale estimator, as well as, non-negativity. Our purpose is to 
prove an S/BSW-type condition for partial maxima (see Theorem 4.2, Corollary 4.1, 
below). To this end, we first state Lemma 4.1, that will be used in the sequel. 

Only through the rest of the present section, we shall use the notation = 
max{Xi, . . . , Xfc}, for any integer k ^ 1. 
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Lemma 4.1 Fix two integers i, j , with 1 ^ % < j, and suppose that the i.i.d. r.v. 's 
Xi,X2, . . . have a common d.f. F(x). Let I (expression) denoting the indicator func- 
tion taking the value 1, if the expression holds true, and otherwise. 

(i) The conditional d.f. of Yj +i given Yj is 

p < y\ ^ = { F ° {y ^ %S %! v y %\ = F (y)i(y > y 3 ), y e r. 

If, in addition, % + 1 < j , then the following property (which is an immediate conse- 
quence of the Markovian character of the extremal process) holds: 

P [Y j+1 < y\ Y i+1 , Yj] = P [Y j+1 *C y\ Yj], y E R. 

(ii) The conditional d.f. of Y-i given Y i+i is 

{ Ft(x) if x<Y , 

P[Yi<x\Y i+1 ] = 1 £; =0 F^ + i)F^(y m -)' 

I 1, if x ^ Y i+l 

FHx) 

= I(x > Y i+1 ) + I(x < Y i+1 ) — : — , x eR. 

If in addition, i + 1 < j , then the following property (which is again an immediate 
consequence of the Markovian character of the extremal process) holds: 

P [Yi < a; | Y i+1 , Y 3 \ = P [YJ < x\ Y i+1 ], x G R. 

(iii) Given (Y i+1 ,Yj), the random variables Y i and Yj +1 are independent. 

We omit the proof since the assertions are simple by-products of the Markovian 
character of the process {Yfc, k ^ 1}, which can be embedded in a continuous time 
extremal process {Y(t), t > 0}; see Resnick (1987, Chapter 4). We merely note that 
a version of the Radon-Nikodym derivative of F' l+1 w.r.t. F is given by 

h, +l {x) = d -^^ = j^F^x)F^(x-), xeR, (4.4) 

which is equal to (i + l)F l (x) only if a; is a continuity point of F. To see this, it 
suffices to verify the identity 

dF i+l (x) = / h i+1 (x) dF(x) for all Borel sets BC1 (4.5) 

B JB 

Now (4.5) is proved as follows: 
/ dF l+l = P (Y i+ i e B) 

J B 
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i+1 



J'=l 
i+1 



i+1 



^i+1 



k=i 



j=l lsCfc 1 <---<fc J sgt+l 

X s < X fcl for s ^ {fci, . . . , fcj}] 

i+1 



= E 

j'=i 

i+1 

= E 

j'=i 



i + 1 
J 

i + 1 
J 

i+1 



p[x x = • • • = Xj -,e B,x j+1 <x u .. .,x i+1 < x l ] 



E 



i+1 



l[l(X k = x) ) ( J] 7(X fc <x) 

k=j+l 



\k=2 



dF(x) 



E 

0=1 



i + l 



(F(x) - F(x—)) j ~ 1 F i+1 ~ j (x—) dF(x) 



h i+1 (x)dF(x), 



where we used the identity Yl*j=i C"j 1 )(^ ~~ a )' ? J — X]j=o^ a * ^ a ^ °- 

We can now show the main result of this section, which presents an S/BSW-type 
condition for the partial maxima spacings. 

Theorem 4.2 Assume that the d.f. F(x), with finite second moment, is a log-concave 
distribution (in the sense that \ogF(x) is a concave function in J, where J = {x e 
R : < < 1}), ant/ /ias not an atom at its right end-point, oo(F) = inf{x G R : 

F(x) = 1}. Then, F(x) belongs to the class NCP ; i.e., (4.3) holds for all n ^ 2. 

Proof: For arbitrary r.v.'s X ^ xq > — oo and F ^ y < +oo, with respective d.f.'s 
Fx, Fy, we have 



E 



[X]=x + (1 - F x (t)) dt and E[E]=y -/ MO 

«/ a;o J —oo 



dt 



(4.6) 



(cf. Papadatos (2001), Jones and Balakrishnan (2002)). Assume that % < j. By 
Lemma 4.1 (i) and (4.6) applied to Fx = Fy j+1 \Yj, it follows that 

roc 

E[y i+1 | = e[y;-+i| ^] =^-+ / (i-f(*)) dt, w. P . 1. 

Similarly, by Lemma 4.1(h) and (4.6) applied to F Y = F Yi \Y i+1 , we conclude that 

"Y i+1 



E[Ej| Y i+l ,Yj] = E[Ej| Y i+ i\ = Y i+i 



1 



hi+i(Y i+ i) 



F\t) dt, w.p. 1, 



where hi + \ is given by (4.4). Note that F is continuous on J, since it is log-concave 
there, and thus, hi+i(x) — (i + l)F l (:r) for x <E J. If is finite, F(x) is also 
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continuous at x — u)(F), by assumption. On the other hand, if a(F) = inf{x : F(x) > 
0} is finite, F can be discontinuous at x — a(F), but in this case, hi + \(a(F)) = 
F\a(F)) > 0; see (4.4). Thus, in all cases, h i+1 (Y i+1 ) > w.p. 1. 

By conditional independence of Y{ and Yj +i (Lemma 4.1 (iii)), we have 

Cw(Zi, Zj\ Y i+1 , Yj) = Cov(Y i+1 - Yi, Y j+1 - Yj\ Y i+1 , Yj) 

= -Cov (Y i: Y j+1 \Y i+1: Yj) = 0, w.p. 1, 

so that E[Cov(Zj, Z 3 \ Yj + i,Y})] = 0, and thus, 

CovfaZj] = Cov[E(Z,| Y l+1 ,Y 3 ),1E(Z 3 \ Y i+1 , Yj)] + EfCov^, Z 3 \ Y i+1 ,Yj)\ 

= Cov[E(r i+1 -r t | y 1+1 ,y 3 )} 
= Cov[r m - E(y,| Y i+ i, Yj), E(y j+1 | y +1 ,y 3 ) - y 3 ] 

= Cov[g(Y i+1 ),h(Yj)l (4.7) 

where 

\ / F i (t) dt x > a(F) f°° 

(i + l)F*(x) W ' 1 h h(x) = I (1 - F(t)) di. 

0, otherwise, 1 

Obviously, ft-(x) is non-increasing. On the other hand, g(x) is non- decreasing in R. 
This can be shown as follows. First observe that g(a(F)) = if a(F) is finite, 
while g(x) > for x > a(F). Next observe that g is finite and continuous at 
x = u(F) if uj(F) is finite, as follows by the assumed continuity of F at x = oo(F) 
and the fact that F has finite variance. Finally, observe that F l (x), a product of 
log-concave functions, is also log-concave in J. Therefore, for arbitrary y G J, the 
function d(x) = F l (x)/ f^F^tfdt, x G (—oo,y) H J, is a probability density, and 
thus, it is a log-concave density with support (— oo,y) fl J. By Prekopa (1973) or 
Dasgupta and Sarkar (1982) it follows that the corresponding distribution function, 
D(x) = ^d{t)dt = f^F^tjdt/ f^F^tjdt, x G (-oo,y) H J, is a log-concave 
distribution, and since y is arbitrary, H(x) = j* co F l {t)dt is a log-concave function, 
for x G J. Since F is continuous in J, this is equivalent to the fact that the function 

H'{x) _ F\x) 

m~ r^mtf x ' 

is non-increasing, so that g(x) = H(x)/((i + l)iF(x)) is non- decreasing in J. 

The desired result follows from (4.7), because the r.v.'s Y^+i and Yj are posi- 
tively quadrant dependent (PQD - Lehmann (1966)), since it is readily verified that 
FY i+1 ,Yj{ x iy) ^ F Yi+1 (x)F Yj (y) for all x and y (Lemma 2.2(i)). This completes the 
proof. □ 

The restriction F(x) — > 1 as x — > oo(F) cannot be removed from the theorem. 
Indeed, the function 

{0 x ^ 0, 
x/4, ^ x < 1, 
1, x>l, 
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is a log-concave distribution in J = (a(F),u(F)) = (0,1), for which Cov[Zi, Z2] = 
jofion > 0. The function g, used in the proof, is given by 



M | max {°'FTiy}' 
[ i+i + (i+i) 2 ^' 

and it is not monotonic. 

Since the family of densities with log-concave distributions contains both families 
of log-concave and non-increasing densities (see, e.g., Prekopa (1973), Dasgupta and 
Sarkar (1982), Sengupta and Nanda (1999), Bagnoli and Bergstrom (2005)), the 
following corollary is an immediate consequence of Theorems 4.1 and 4.2. 



x < 1, 

X ^ 1, 



Corollary 4.1 Assume that F(x) has finite second moment. 

(i) If F(x) is a log-concave d.f. (in particular, if F(x) has either a log-concave or a 
non-increasing (in its interval support) density f(x)), then the partial maxima BLUE 
and the partial maxima BLIE of 62 are non-negative. 

(ii) If F(x) has either a log-concave or a non-increasing (in its interval support) 
density f(x) then it belongs to the NCP class. 

Sometimes it is asserted that "the distribution of a log-convex density is log- 
concave" (see, e.g., Sengupta and Nanda (1999), Proposition 1(e)), but this is not 
correct in its full generality, even if the corresponding r.v. X is non-negative. For 

example, let Y ~ Weibull with shape parameter 1/2, and set X — Y\Y < 1. Then 
X has density / and d.f. F given by 

exp(-Vl - x) exp(- v / l - x) - e _1 

f( x ) = —, — 1V , 1 Fix) = — r , < x < 1, 

V ; 2(l-e- 1 ) v ^^' V ; 1-e- 1 

and it is easily checked that log / is convex in J = (0, 1), while F is not log-concave in 
J. However, we point out that if sup J = +00 then any log-convex density, supported 
on J, has to be non-increasing in J and, therefore, its distribution is log-concave in 
J. Examples of log-convex distributions having a log-convex density are given by 
Bagnoli and Bergstrom (2005). 



5 Consistent estimation of the scale parameter 

Through this section we always assume that F(x), the d.f. that generates the location- 
scale family, is non-degenerate and has finite second moment. The main purpose is 
to verify consistency for L 2 , applying the results of section 4. To this end, we firstly 
state and prove a simple lemma that goes through the lines of Lemma 2.1. Due to 
the obvious fact that MSE[T 2 ] ^ Var[L 2 ], all the results of the present section apply 
also to the BLIE of 9 2 . 
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Lemma 5.1 If F(x) belongs to the NCP class then 
(i) 

Z^k=i " l kl b k 

where m k = E[Z k ] is the k-th component of the vector m and si = Skk = Var [Z k ] is 

the k-th diagonal entry of the matrix S. 

(ii) The partial maxima BLUE, L 2 , is consistent if the series 

OO o 

wr 



k=i 



Proof: Observe that part (ii) is an immediate consequence of part (i), due to the 
fact that, in contrast to the order statistics setup, m k and s\ do not depend on the 
sample size n. Regarding (i), consider the linear unbiased estimator 

n— 1 n n—l 

1 ^ m k d 2 m k 

° n k=l ^ C n k=1 * k 

where c n = Ylk=l m l/ s k- Since F(x) belongs to NCP and the weights of U 2 are 
positive, it follows that the variance of U 2 , which is greater than or equal to the 
variance of L 2 , is bounded by the RHS of (5.1); this completes the proof. □ 

The proof of the following theorem is now immediate. 

Theorem 5.1 If F(x) belongs to the NCP class and if there exists a finite constant 
C and a positive integer k such that 



E[Z, 



k"E 2 [Z k ] 

then 



< C, for all k > fc , (5.3) 



Var[L 2 ] < O , as n ->■ oo. (5.4) 

\ log 71/ 



Proof: Since for k ^ k , 



m l m l 



4 nz 2 k]-mi nzi\ 1 'ck-v 

E 2 [Z k ] 

the result follows by (5.1). □ 

Thus, for proving consistency of order 1 / log n into NCP class it is sufficient 
to verify (5.3) and, therefore, we shall investigate the quantities m k = E[Z fc ] and 
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E[Z|] = s\ + m\. A simple application of Lemma 2.2, observing that = /ik+i — ^k 
and sf = <j1+\ — 2o"fe,fc+i + erf, shows that 



/oo 
F k (x)(l-F(x)) dx, 
-oo 



(5.5) 



E 2 [Z fe ] = 2 J J 

nzi\ = 2 



F\x){l - F{x))F\y){\ - F(y)) dy dx, (5.6) 



oo<a;<y<oo 




F k {x){l-F{y))dy dx. 



(5.7) 



oo<a;<y<oo 



Therefore, all the quantities of interest can be expressed as integrals in terms of the 
(completely arbitrary) d.f. F(x) (cf. Jones and Balakrishnan (2002)). 

For the proof of the main result we finally need the following lemma and its 
corollary. 



Lemma 5.2 (i) For any t > — 1, 

lim k 1+t [ u k (l- U y du = T(l+t)>0. 

k ^°° Jo 



(5.8) 



(ii) For any t with ^ t < 1 and any a > 0, there exist positive constants C\, C2, 
and a positive integer ko such that 



< d < A; 1+t (logA;) a / 1 ; du < C 2 < 00, for all k ^ k , (5.9) 

Jo L a (u) 

where L{u) = — log(l — u). 

Proof: Part (i) follows by Stirling's formula. For part (ii), with the substitution 
u — 1 — e~ x , we write the integral in (5.9) as 



■ /-oo 



,_ x exp(-tx) 



dx 



x u 



k+1 



■E 



exp(— tT) 



where T has the same distribution as the maximum of k+1 i.i.d. standard exponential 
r.v.'s. It is well-known that E[T] = 1 _1 + 2 _1 + • • • + (k + l) -1 . Since the second 
derivative of the function x — > e~ tx /x a is x~ a ~ 2 e~ tx (a + (a + tx) 2 ), which is positive 
for x > 0, this function is convex, so by Jensen's inequality we conclude that 



A: 1+ *(logA;) a f 
Jo 



1 u k (l-u) t , k 
{L{u)Y 



log k 



fc+1 V1 + 1/2H hl/(fc + l) 



x exp 



1 



fc+1 



log k 



and the RHS remains positive as k — > 00, since it converges to e 7 ', where 7 = 
.5772. . . is Euler's constant. This proves the lower bound in (5.9). Regarding the 
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upper bound, observe that the function g(u) = (1 — uY/L a (u), u G (0, 1), has second 
derivative 

g"{u) = { La+ > [t(l - t)L\u) + a(l - 2t)L(u) - a(a + 1)], < u < 1, 

and since ^ £ < 1, a > and L(u) — > +oo as u — > 1— , it follows that there exists a 
constant b G (0, 1) such that g(u) is concave in (6, 1). Split now the integral in (5.9) 
in two parts, 

and observe that for any fixed s > and any fixed b G (0, 1), 

* 7o ^ a («) Jo L-{u) 

because the last integral is finite and independent of k. Therefore, k 1+t {\ogk) a I k is 
bounded above if k 1+t (log k) a lj?\b) is bounded above for some b < 1. Choose b close 
enough to 1 so that g(u) is concave in (b, 1). By Jensen's inequality and the fact that 
1 — b k+1 < 1 we conclude that 

4 2) W = -j^- 1 h{u)g{u) du = -j—j-EMv)] < _^[E(V)], 

where V is an r.v. with density /&(«) — (k + l)w fc /(l — b k+1 ), for u G (6, 1). Since 
E(V) = ((A; + l)/(Jfc + 2))(1 - b k+2 )/{\ - b k+1 ) > (k + l)/(k + 2), and g is positive 
and decreasing (its first derivative is g'(u) = —(1 — u)^ 1 L~ a ~ x (u)(tL(u) + a) < 0, 
< u < 1), it follows from the above inequality that 



fc / log A; \ a ( k 



k+ 1 \log(fc + 2) / V^ + 2 



C 1. 



This shows that A; 1+ '(log k) a lf\b) is bounded above, and thus, k 1+t (\ogk) a Ik is 
bounded above, as it was to be shown. The proof is complete. □ 

Corollary 5.1 (i) Under the assumptions of Lemma 5.2(i), for any b G [0,1) there 
exist positive constants A\, A 2 and a positive integer ko such that 

< A 1 < k l+t J u k (l - uf du< A 2 < +oo, for all k > k . 

(ii) Under the assumptions of Lemma 5.2(ii), for any b G [0, 1) there exist positive 
constants A s , A 4 and a positive integer ko such that 

j-^ ' du< A A < oo, for all k ^ k . 
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Proof: The proof follows from Lemma 5.2 in a trivial way, since the corresponding 
integrals over [0, b] are bounded above by a multiple of b k ~ a , of the form Ab h ~ a , with 
A < +00 being independent of k. □ 

We can now state and prove our main result: 

Theorem 5.2 Assume that F(x) lies in NCP ; and let oj = uj(F) be the upper end- 
point of the support of F, i.e., u = inf{x G K : F(x) = 1}, where u> = +00 if 
F(x) < 1 for all x. Suppose that \im x ^ u> _ F(x) = 1, and that F(x) is dijferentiable 
in a left neighborhood (M,u) of oj, with derivative f(x) = F'(x) for x G (M,oj). For 
5 G R and 76M, define the (generalized hazard rate) function 

L W = L fe 5, 7 ;F) = (i _ F(i))7( ^ (i _ FM))< , *<=(*,„), (5.10) 

and set 

= L*(6, 7; F) = liminf L(x; 5, 7, F), L* = L*(S, 7; F) = limsup L(x; 5, 7, F). 

// either (i) /or some 7 < 3/2 and 5 = 0, or (ii) for some 5 > and some 7 mf/i 
1/2< 7 <1, 

< L*(6,r,F) < L*(5, r ,F) < +00, (5.11) 

t/ien £/ie partial maxima BLUE L 2 (given by (2.2) or (4.1)) 0/ t/ie sca/e parameter 62 
is consistent and, moreover, Var[L 2 ] ^ 0(1/ log n). 

Proof: First observe that for large enough x < u, (5.11) implies that f(x) > 
(L*/2)(l - F(x)) 7 (-log(l - F(x))) s > 0, so that F(x) is eventually strictly in- 
creasing and continuous. Moreover, the derivative f(x) is necessarily finite since 
f(x) < 2L*(1 - F(x)) 7 (-log(l - F(x))) 5 . The assumption \im x ^_F(x) = 1 now 
shows that F _1 (n) is uniquely defined in a left neighborhood of 1, that F(F -1 (m)) = u 
for u close to 1, and that lim^^ F _1 (w) = u. This, in turn, implies that F _1 (n) is 
differentiable for u close to 1, with (finite) derivative (F~ l {u))' = 1/ f(F~ 1 (u)) > 0. 
In view of Theorem 5.1, it suffices to verify (5.3), and thus we seek for an upper 
bound on E[Z|] and for a lower bound on E[Z fe ]. Clearly, (5.3) will be deduced if 
we shall verify that, under (i), there exist finite constants C 3 > 0, C 4 > such that 

k 3 ~ 2 m[Zl] < 3 and k 2 ^M[Z k ] > C 4 , (5.12) 

for all large enough k. Similarly, (5.3) will be verified if we show that, under (ii), 
there exist finite constants C 5 > and C e > such that 

A; 3 - 27 (logA;) 25 E[^] ^ C 5 and fc 2 - 7 (logife)*E[Z fc ] ^ C 6 , (5.13) 

for all large enough k. Since the integrands in the integral expressions (5.5)-(5.7) 
vanish if x or y lies outside the set {x G R : < F(x) < 1}, we have the equivalent 
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E[Z k ] = / F k (x)(l- F(x)) dx, (5.14) 

J a 



expressions 



nzl] = 2 / F k (x)(l-F(y))dydx, (5.15) 

J J a<x<y<ui 

where a (resp., ou) is the lower (resp., the upper) end-point of the support of F. 
Obviously, for any fixed M with a < M < u and any fixed s > 0, we have, as in the 
proof of Lemma 5.2(h), that 



• M 

?ki 



lim k" / F k {x){l - F(x)) dx = 0, 
lim k s / / F k (x)(l- F(y)) dy dx = 0, 

Ja Jx 



because F(M) < 1 and both integrals (5.14), (5.15) are finite for k — 1, by the 
assumption that the variance is finite ((5.15) with k = 1 just equals to the variance 
of F; see also (2.9) with % = 1). Therefore, in order to verify (5.12) and (5.13) 
for large enough k, it is sufficient to replace E[Z fc ] and E[Zjjj], in both formulae 
(5.12), (5.13), by the integrals F k (x)(l-F(x))dx and J£ g F k (x)(l-F(y))dydx, 
respectively, for an arbitrary (fixed) M G (a,u). Fix now M G (cx,u) so large that 
f(x) = F'(x) exists and it is finite and strictly positive for all x G (M,u), and 
make the transformation F(x) — u in the first integral, and the transformation 
(F(x),F(y)) = (u,v) in the second one. Both transformations are now one-to-one 
and continuous, because both F and F^ 1 are differentiable in their respective intervals 
(M, ou) and (F(M), 1), and their derivatives are finite and positive. Since F~ 1 (-u) — > u 
as u — y 1—, it is easily seen that (5.12) will be concluded if it can be shown that for 
some fixed b < 1 (which can be chosen arbitrarily close to 1), 

1 u k ( f 1 1 -v 



holds for all large enough k. Similarly, (5.13) will be deduced if it will be proved that 
for some fixed b < 1 (which can be chosen arbitrarily close to 1), 

^(log,)-/ 1 7 ^(/ 1 7 ^^), tt < C 5 and (5,8) 

k 2 ^(\og k) s f f, { r~ U l du 2 C e , (5.19) 

holds for all large enough k. The rest of the proof is thus concentrated on showing 
(5.16) and (5.17) (resp., ((5.18) and (5.19)), under the assumption (i) (resp., under 
the assumption (ii)). 
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Assume first that (5.11) holds under (i). Fix now b < 1 so large that 
L 



, ? (1 - F(x)y < f(x) < 2L*(1 - F(x))\ for all x e w); 



equivalently, 



2L* /(F-i(«)) L 
Due to (5.20), the inner integral in (5.16) is 



< }L-u ^ < 4-i for all m G (6,1). (5.20) 



r 1 l-v 

Ju f(F-i(v) 



By Corollary 5.1(i) applied for t = 2 — 27 > —1, the LHS of (5.16) is less than or 
equal to 



for all k ^ fc , with C3 = 4A 2 L~ 2 (2 — 7) -1 < 00, showing (5.16). Similarly, using the 
lower bound in (5.20), the integral in (5.17) is 

so that, by Corollary 5.1(i) applied for t = 1 — 7 > —1, the LHS of (5.17) is greater 
than or equal to 

p-7 f 1 A-, 



2L* 



J u k (l - u) 1 ^ du^^>0, for all k ^ k , 



showing (5.17). 

Assume now that (5.11) is satisfied under (ii). As in part (i), choose a large 
enough b < 1 so that 

1 < il -"»;?'"> < A, for all u € (6, 1), (5.21) 



2L« /(F-'fu)) L 
where = — log(l — it). Due to (5.21), the inner integral in (5.18) is 

1 (l-vf-i (1-vVLHv) , 2 ^(l-v) 1 ^ , 2(1 -m) 2 ^ 

dv ^ — — — \ — dv < 



because (1 — u) 1 " 1 / L 5 (u) is decreasing (see the proof of Lemma 5.2(h)). By Corollary 
5.1(h) applied for t = 2 — 27 e [0, 1) and a = 25 > 0, the double integral in (5.18) is 
less than or equal to 

1 u k (l - uf- 2 ^ (1 - u)^L 5 (u) , 4 f 1 u k (l - uf- 2 ^ , C 5 

du < — / ^ r ,, . . ait ^ 



L* y 6 L 25 (iz) f{F-^(u)) L 2S (u) ^ A; 3 ^ (log A;) 2(5 ' 
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for all k ^ ko, with C5 = AA^L^ 2 < 00, showing (5.18). Similarly, using the lower 
bound in (5.21), the integral in (5.19) is 



1 u k (l - u) 1 ^ (1 - U yL s (u) 1 f 1 u k (l - u) 1 ^ 

du ^5 ~ / f. . ~ du^ 



L\u) f(F-i(u)) ^2L*J b L*(u) 



and thus, by Corollary 5.1(h) applied for t — 1 — 7 G [0, 1) and a = 5 > 0, the LHS 
of (5.19) is greater than or equal to 



Z" 1 u k (l 

Jb ~ 



2L* J b L s (u) 2L 
showing (5.19). This completes the proof. □ 

Remark 5.1 Taking L(u) = — log(l — u), the limits and L* in (5.11) can be 
rewritten as 

L*(5, T ,F) = hminf /(F ^ )} = ( limsup(F- 1 H) / (l - U yL s (u)) , 
u^i- (l-upL°{u) \ u -+i- J 

L*(5, T ,F) = hmsup /( ^ )} = (lhnin{(F-\u)y(l-uyL s (u)) \ 
u-+i- (i.—upL°(u) \ti-n- J 

In the particular case where F is absolutely continuous with a continuous density / 
and interval support, the function f{F^ 1 {u)) = l/(F~ 1 (-u)) / is known as the density- 
quantile function (Parzen, (1979)), and plays a fundamental role in the theory of 
order statistics. Theorem 5.2 shows, in some sense, that the behavior of the density- 
quantile function at the upper end-point, u—1, specifies the variance behavior of the 
partial maxima BLUE for the scale parameter 2 . In fact, (5.11) (and (6.1), below) 
is a Von Mises-type condition (cf. Galambos (1978), §§2.7, 2.11). 

Remark 5.2 It is obvious that condition lim a ,_ ! . a; _ F(x) = 1 is necessary for the 
consistency of BLUE (and BLIE). Indeed, the event that all partial maxima are 
equal to 00(F) has probability po = F(u) — F(u—) (which is independent of n). 
Thus, a point mass at x — 00(F) implies that for all n, P(L 2 = 0) ^ po > 0. This 
situation is trivial. Non-trivial cases also exist, and we provide one at the end of next 
section. 



6 Examples and conclusions 

In most commonly used location-scale families, the following corollary suffices for 
concluding consistency of the BLUE (and the BLIE) of the scale parameter. Its 
proof follows by a straightforward combination of Corollary 4.1(h) and Theorem 5.2. 

Corollary 6.1 Suppose that F is absolutely continuous with finite variance, and 
that its density f is either log-concave or non-increasing in its interval support 
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J = (a(F),u(F)) = {x G R : < F(x) < 1}. //, either for some 7 < 3/2 and 
5 = 0, or for some 5 > and some 7 with 1/2 < 7 ^ 1, 

f(x) 

lim 7 r,/ XX / i / =7 — TTTT = L G (0, +00), (6.1) 

rr-Kj(F)- (1 — F(x)) 7 (— log(l — F(x))) S V ' V ' 

£/ien t/ie partial maxima BLUE of the scale parameter is consistent and, moreover, 
its variance is at most of order 0(1/ logn). 

Corollary 6.1 has immediate applications to several location-scale families. The fol- 
lowing are some of them, where (6.1) can be verified easily. In all these families 
generated by the distributions mentioned below, the variance of the partial maxima 
BLUE L 2 (see (2.3) or (4.1)), and the mean squared error of the partial maxima 
BLIE T 2 (see (2.5) or (4.2)) of the scale parameter is at most of order 0(1/ logn), as 
the sample size n — > 00. 

1. Power distribution (Uniform). F(x) = x x , f(x) = Xx* -1 , < x < 1 (A > 0), 

and 00(F) = 1. The density is non-increasing for A ^ 1 and log-concave for A ^ 1. It 
is easily seen that (6.1) is satisfied for 5 = 7 = (for A = 1 (Uniform) see section 3). 

2. Logistic distribution. F(x) = (1 + e~ x )~ l , f(x) = e~ x (l + e~ x )~ 2 , x G R, and 
ou(F) = +00. The density is log-concave, and it is easily seen that (6.1) is satisfied 
for 5 = 0, 7 = 1. 

3. Pareto distribution. F(x) = 1 — x~ a , f(x) = ax~ a ~ l , x > 1 (a > 2, so that the 
second moment is finite), and 00(F) = +00. The density is decreasing, and it is easily 
seen that (6.1) is satisfied for 5 = 0, 7 = 1 + 1/a. Pareto case provides an example 
which lies in NCP and not in NCS class - see Bai, Sarkar & Wang (1997). 

4. Negative Exponential distribution. F(x) = f(x) = e x , x < 0, and oj(F) = 0. 
The density is log-concave and it is easily seen that (6.1) is satisfied for 5 = 7 = 0. 
This model is particularly important, because it corresponds to the partial minima 
model from the standard exponential distribution - see Samaniego and Whitaker 
(1986). 

5. Weibull distribution (Exponential). F(x) = 1 — e~ x , f(x) = cx c ~ l exp(— x c ), 
x > (c > 0), and 00(F) = +00. The density is non-increasing for c ^ 1 and log- 
concave for c ^ 1, and it is easily seen that (6.1) is satisfied for 5 = 1 — 1/c, 7 = 1. 
It should be noted that Theorem 5.2 does not apply for c < 1, since 5 < 0. 

6. Gumbel (Extreme Value) distribution. F(x) = exp(— e~ x ) = e x f(x), x G K, 

and 00(F) = +00. The distribution is log-concave and (6.1) holds with 7 = 1, 5 — 
(L = 1). This model is particularly important for its applications in forecasting 
records, especially in athletic events - see Tryfos and Blackmore (1985). 

7. Normal Distribution. f(x) = ip(x) = (2ne x2 )- 1 / 2 , F = $, x G R, and 00(F) = 
+00. The density is log-concave and Corollary 6.1 applies with 5 = 1/2 and 7 = 1. 
Indeed, 

lim, V{X) - lim V{X) 



+00 (l-$(x))(-log(l-<I>(a;))) 1 /2 +oo x(l - $(x)) (-log(l -^(a;))) 1 ^' 
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and it is easily seen that 

.. (fix) , V X2 

hm — — - = 1, hm — - — ; — - = 2, 

+00 x (l - ' +00 - log(l - $(x)) 

so that L = \[2~. 

In many cases of interest (especially in athletic events), best performances are 
presented as partial minima rather than maxima; see, e.g., Tryfos and Blackmore 
(1985). Obviously, the present theory applies also to the partial minima setup. The 
easiest way to convert the present results for the partial minima case is to consider 
the i.i.d. sample —X±, . . . , — X n , arising from F_x(x) = 1 — Fx(— x— ), and to observe 
that min{Xi, . . . , Xj} = — max{— X±, . . . , — Xj}, i — 1, . . . , n. Thus, we just have to 
replace F(x) by 1 — F(— x— ) in the corresponding formulae. 

There are some related problems and questions that, at least to our point of 
view, seem to be quite interesting. One problem is to verify consistency for the 
partial maxima BLUE of the location parameter. Another problem concerns the 
complete characterizations of the NCP and NCS classes (see Definition 4.1), since 
we only know S/BSW-type sufficient conditions. Also, to prove or disprove the non- 
negativity of the partial maxima BLUE for the scale parameter, outside the NCP 
class (as well as for the order statistics BLUE of the scale parameter outside the NCS 
class). 

Some questions concern lower variance bounds for the partial maxima BLUEs. 
For example we showed in section 3 that the rate 0(1/ logn) (which, by Theorem 5.2, 
is just an upper bound for the variance of L2) is the correct order for the variance of 
both estimators in the Uniform location-scale family. Is this the usual case? If it is so, 
then we could properly standardize the estimators, centering and multiplying them 
by (logn) 1 / 2 . This would result to limit theorems analogues to the corresponding 
ones for order statistics - e.g., Chernoff, Gastwirth & Johns (1967); Stigler (1974); 
- or analogues to the corresponding ones of Pyke (1965), (1980), for partial maxima 
spacings instead of ordinary spacings. However, note that the Fisher-Information 
approach, in the particular case of the one-parameter (scale) family generated by the 
standard Exponential distribution, suggests a variance of about W^/ (logn)^ for the 
minimum variance unbiased estimator (based on partial maxima) - see Hoffman and 
Nagaraja (2003, eq. (15) on p. 186). 

A final question concerns the construction of approximate BLUEs (for both loca- 
tion and scale) based on partial maxima, analogues to Gupta's (1952) simple linear 
estimators based on order statistics. Such a kind of approximations and/or limit 
theorems would be especially useful for practical purposes, since the computation of 
BLUE via its closed formula requires inverting an n x n matrix. This problem has 
been partially solved here: For the NCP class, the estimator £7 2 , given in the proof 
of Lemma 5.1, is consistent for 9 2 (under the assumptions of Theorem 5.2) and can 
be computed by a simple formula if we merely know the means and variances of the 
partial maxima spacings. 

Except of the trivial case given in Remark 5.2, above, there exist non-trivial 
examples where no consistent sequence of unbiased estimators exist for the scale 
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parameter. To see this, we make use of the following result. 



Theorem 6.1 (Hofmann and Nagaraja 2003, p. 183) Let X%,X%, . . . ,X* be an i.i.d. 

sample from the scale family with distribution function F(x; 62) = F(x/02) (6*2 > 
is the scale parameter) and density f(x;02) = f(x/02)/02, where f(x) is known, it 
has a continuous derivative f, and its support, J(F) = {x : f(x) > 0}, is one of the 
intervals (—00,00), (— 00, 0) or (0, +00). 

(i) The Fisher Information contained in the partial maxima data X^ :1 ^ X% :2 ^ • • • ^ 

X n:n is 9 iven b V 



(ii) The Fisher Information contained in the partial minima data X{. A ^ X{. 2 ^ • • • ^ 
X±. n is given by 

°2 XT[ Jj(F) V f{x) 1 - F{x) J 

It is clear that for fixed 62 > 0, I™ 3 * and 7™ m both increase with the sample size 
n. In particular, if J(F) = (0, 00) then, by Beppo-Levi's Theorem, J™ in converges 
(as n — > 00) to its limit 



J min = 1 p( x ) 1 + ±L^±l - Xfl ( x ) + x 2 fi 2 (x) (X(x) + n{x)) ) dx, (6.2) 



where X(x) = f(x)/(l — F(x)) and //(#) = f(x)/F(x) is the failure rate and re- 
verse failure rate of /, respectively. Obviously, if I mm < +00, then the Cramer-Rao 
inequality shows that no consistent sequence of unbiased estimators exists. This, 
of course, implies that in the corresponding scale family, any sequence of linear (in 
partial minima) unbiased estimators is inconsistent. The same is clearly true for the 
location-scale family, because any linear unbiased estimator for 9 2 in the location- 
scale family is also a linear unbiased estimator for 9 2 in the corresponding scale 
family. 

In the following we show that there exist distributions with finite variance such 
that 7 min in (6.2) is finite: Define s = e~ 2 and 



F(x) 



0, x ^ 0, 

< x ^ s, 



1 — log(x) 
1 — (ax 2 + bx + c)e~ x , x ^ s, 



where 



^-exp(e" 2 )(18 - 6e 2 + e 4 ) ~ 0.599, 
54 
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b = -J^exp(-2 + e- 2 )(9-12e 2 + 2e 4 ) ~ -0.339, 

c = — a eX p(-4 + e" 2 ) (18 - 42e 2 + 43e 4 ) ~ 0.798. 
54 



Noting that F(s) = 1/3, F'(s) = e 2 /9 and F"(s) = -e 4 /27, it can be easily verified 
that the corresponding density 

1 

< x ^ s, 



f(x)=l x{l -log(x)) 2 ' 

[ (ax 2 + (b — 2a)x + c — b)e~ x , x ^ s, 

is strictly positive for x G (0, oo), processes finite moments of any order, and has 
continuous derivative 

log(ar) 

< x ^ s, 



/'(x) = <J x 2 (l-log(x)) 3 ' 

(ax 2 + (b - 4a)x + 2a - 2b + c)e~ x , x ^ s. 

Now the integrand in (6.2), say S(x), can be written as 

1 - 2 log(x) < 
S(x) = <J x(-log(x))(l-log(x)) 3 ' < X ^ S ' 
A(x)+.B(x), x ^ s, 

where, as x — >■ +oo, A(x) ~ Ax 4 e~ :E and 5(x) ~ Bx 6 e~ 2x , with A, i? being positive 
constants. Therefore, S(x)dx = J 2 °° ^ffjfcdy = log(3/2) - 5/18 ~ 0.128. Also, 
since S(x) is continuous in [s, +oo) and S(x) ~ Ax A e~ x as x — >■ +oo, it follows that 
r s °° S(x)dx < +oo and J min is finite. 

Numerical integration shows that S(x)dx ~ 2.77 and thus, I mm ~ 2.9/9% < 
3/#2- In view of the Cramer- Rao bound this means that, even if a huge sample of 
partial minima has been recorded, it is impossible to construct an unbiased scale 
estimator with variance less than 0f/3. Also, it should be noted that a similar 
example can be constructed such that f"(x) exists (and is continuous) for all x > 0. 

Of course the above example can be adapted to the partial maxima case by 
considering the location-scale family generated by the distribution function 



F(x) = < 



with s, a, b and c as before. 



(ax 2 — bx + c)e x , x ^ — s, 
- log(-x) 



, , / v -s^x<0, 
1 — log(— X) 

1, x^0, 
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